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1 Data mining: Mining the peanut gallery: opinion extraction and semantic classification 
of products 

Kushal Dave, Steve Lawrence, David M. Pennock 

May 2003 Proceedings of the 12th international conference on World Wide Web 



Full text available: f |pdr(327. 95 KB) 



Additional Information: full citation , abstract, references , citings , index 
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The web contains a wealth of product reviews, but sifting through them is a daunting task. 
Ideally, an opinion mining tool would process a set of search results for a given item, 
generating a list of product attributes (quality, features, etc.) and aggregating opinions 
about each of them (poor, mixed, good). We begin by identifying the unique properties of 
this problem and develop a method for automatically distinguishing between positive and 
negative reviews. Our classifier draws on information ... 

Keywords: document classification, opinion mining 
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Vasileios Hatzivassiloglou, Janyce M. Wiebe 

July 2000 Proceedings of the 17th conference on Computational linguistics - Volume 1 

Full text available: ^ pdff692.97 KB) Additional Information: full citation , abstract , references , citings 

Subjectivity is a pragmatic, sentence-level feature that has important implications for text 
processing applications such as information extraction and information retrieval. We study 
the effects of dynamic adjectives, semantically oriented adjectives, and gradable adjectives 
on a simple subjectivity classifier, and establish that they are strong predictors of 
subjectivity. A novel trainable method that statistically combines two indicators of 
gradability is presented and evaluated, complementin ... 

3 Measuringpraise.and 

Peter D. Turney, Michael L. Littman 

October 2003 ACM Transactions on Information Systems (TOIS), volume 21 issue 4 
Full text available: |j | .odf(640.81 KB) Additional Information: full citation, abstract , references, index terms 

The evaluative character of a word is called its semantic orientation. Positive semantic 
orientation indicates praise (e.g., "honest", "intrepid") and negative semantic orientation 
indicates criticism (e.g., "disturbing", "superfluous"). Semantic orientation varies in both 
direction (positive or negative) and degree (mild to strong). An automated system for 
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measuring semantic orientation would have application in text classification, text filtering, 
tracking opinions in online discussions ... 

Keywords: latent semantic analysis, mutual information, semantic association, semantic 
orientation, text classification, text mining, unsupervised learning, web mining 



4 WoM.sense„disam 
Ido Dagan, Alon Itai 

December 1994 Computational Linguistics, volume 20 issue 4 

Full text available:^* # _ (3 

^pdfC2 ; 5ZMBl™ Additional Information: Ml citation, abstract, references, citings 
Publisher Site 

This paper presents a new approach for resolving lexical ambiguities in one language using 
statistical 'data from a monolingual corpus of another language. This approach exploits the 
differences between mappings of words to senses in different languages. The paper 
concentrates on the problem of target word selection in machine translation, for which the 
approach is directly applicable. The presented algorithm identifies syntactic relations 
between words, using a source language parser, and maps t ... 
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EyaJuMlOS.^ 

Nancy Chinchor, David D. Lewis, Lynette Hirschman 
September 1993 Computational Linguistics, Volume 19 Issue 3 

Full text available:,™ -^ OMD ,i| 

T^Satto.00 .sylBA seh Additional Information: MLcMtion, abstract, references, citings 
Publisher Site 

This paper describes and analyzes the results of the Third Message Understanding 
Conference (MUC-3). It reviews the purpose, history, and methodology of the conference, 
summarizes the participating systems, discusses issues of measuring system effectiveness, 
describes the linguistic phenomena tests, and provides a critical look at the evaluation in 
terms of the lessons learned. One of the common problems with evaluations is that the 
statistical significance of the results is unknown. In the disc ... 

iDformaMaM 

Ellen Riloff, Wendy Lehnert 

July 1994 ACM Transactions on Information Systems (TOIS), volume 12 issue 3 

Full text available: f | pdff2.79 MB) Additional Information: M citation, abstract, references, citings, index 

terms, jgyj&tf 

We describe an approach to text classification that represents a compromise between 
traditional word-based techniques and in-depth natural language processing. Our approach 
uses a natural language processing task called "information extraction" as a basis for high- 
precision text classification. We present three algorithms that use varying amounts of 
extracted information to classify texts. The relevancy signatures algorithm uses linguistic 
phrases; the a ... 

Keywords: information extraction, text classification 
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Benjamin L. Chen, Von-Wun Soo 

August 1992 Proceedings of the 14th conference on Computational linguistics - Volume 
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Full text available: ^pdf(M2-J.§.KBj Additional Information: full citation, abstract, references 

Anaphoric reference is an important linguistic phenomenon to understand the discourse 
structure and content. In Chinese natural language processing, there are both the problems 
of choosing and resolving anaphora. In Mandarin Chinese, several linguists have attempted 
to propose criteria to explain the phenomenon of anaphora but with controversial results. 
On the other hand, search-based computational techniques for resolving anaphora are 
neither the best way to resolve Chinese anaphora nor to fac ... 

Keywords: anaphoric reference, case based learning, natural language acquisition, 
semantic roles (case) 



8 Automatic verb classification based on statistical distributions of argument structure 
Paola Merlo, Suzanne Stevenson 

September 2001 Computational Linguisti CS, Volume 27 Issue 3 

Full text available:^ _. (f| 

^patu,MMp.L^ Additional Information: Ml citation, abstract, references, citings 
Publisher Site 

Automatic acquisition of lexical knowledge is critical to a wide range of natural language 
processing tasks. Especially important is knowledge about verbs, which are the primary 
source of relational information in a sentence— the predicate-argument structure that 
relates an action or state to its participants (i.e., who did what to whom). In this work, we 
report on supervised learning experiments to automatically classify three major types of 
English verbs, based on their argument structure— sp ... 

9 Fjexibie..parsing..of dlscr^ely.ufe 
Luigi Borghesi, Chiara Favareto 

July 1982 Proceedings of the 9th conference on Computational linguistics - Volume 1 

Full text available: ■ gpdf(2S5.46 KB) Additional Information: full citation , abstract , references 

In this paper we describe a syntactic semantic parser of spoken sentences pertaining to a 
subset of natural Italian language. Error-free and fast analysis, partial interpretation ability, 
man-machine dialogue trend, different semantic environment adaptability and natural 
language usage are its main characteristics. All of these features are supported by a 
technique of input reliability evaluation. Particular attention is devoted to the description of 
the knowledge internal representation and of ... 

10 Automatic semantic 

implemented classifier for stativity 
Michael R. Brent 

April 1991 Proceedings of the fifth conference on European chapter of the Association 
for Computational Linguistics 

Full text available: MpdR.4m23.KB) 

If Additional Information: full citation , abstract , references , citings 

This paper discusses an implemented program that automatically classifies verbs into those 
that describe only states of the world, such as to know, and those that describe events, 
such as to look. It works by exploiting the constraint between the syntactic environments in 
which a verb can occur and its meaning. The only input is on-line text. This demonstrates 
an important new technique for the automatic generation of lexical databases. 

domajn..que^ 

Hui Yang, Tat-Seng Chua, Shuguang Wang, Chun-Keat Koh 
July 2003 
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Proceedings of the 26th annual international ACM SIGIR conference on 
Research and development in informaion retrieval 

Full text available: , P |pdff287.93 KB? Additional Information: Miration, abstract, refererices, citings, index 
' ~* terms 

One of the major problems in question answering (QA) is that the queries are either too 
brief or often do not contain most relevant terms in the target corpus. In order to overcome 
this problem, our earlier work integrates external knowledge extracted from the Web and 
WordNet to perform Event-based QA on the TREC-11 task. This paper extends our approach 
to perform event-based QA by uncovering the structure within the external knowledge. The 
knowledge structure loosely models different facets of ... 

Keywords: event-based QA, query formulation, question answering 



12 Special issue: Ai in engineering j 
D. Sriram, R. Joobbani 

January 1985 ACM SIGART Bulletin; issue 91 

Full text available: ^.pdf{8,79 MB) Additional Information: MLcitatipn, abstract 

The papers in this special issue were compiled from responses to the announcement in the 
July 1984 issue of the SIGART newsletter and notices posted over the ARPAnet. The interest 
being shown in this area is reflected in the sixty papers received from over six countries. 
About half the papers were received over the computer network. 

13 Session 5: Paradoxes and semantic representation j 
Richmond H. Thomason 

March 1986 Proceedings of the 1986 conference on Theoretical aspects of reasoning 
about knowledge 

Full text available: ^.pdf(i..24 MB) Additional Information: ML citation, abstract, references 

Many researchers in Computer Science, Linguistics, Logic, and Philosophy have been 
discovering in various ways that analogues of the Liar paradox pose deep foundational 
problems for intensional semantics. In this survey paper I state my own view of what the 
problems are, and try to provide a broad perspective on the issues, with many references 
to the literature. Although the paper concentrates on developments in Philosophical Logic, I 
hope that the paper will help all researchers concerned w ... 

14 Knowledge representation for commonsense reasoning with text | 
Kathleen Dahlgren, Joyce McDowell, Edward P. Stabler 

September 1989 Computational Linguistics, volume 15 issue 3 

Full text available: g] 

HI pent Mb; ^ Additional Information: full citation, references, citings 
P.ybJi§h§r.Sjt§. 
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January 1987 Computational Linguistics, Volume 13 Issue 1-2 

Full text available:^ SI 
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Publisher Site 
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Ronald J. Brachman, Brian C. Smith 
February 1980 ACM SIGART Bulletin, issue 70 

Full text available: ^]pdf{13.13 MB) Additional Information: Ml citation, abstract 

In the fall of 1978 we decided to produce a special issue of the SIGART Newsletter devoted 
to a survey of current knowledge representation research. We felt that there were twe 
useful functions such an issue could serve. First, we hoped to elicit a clear picture of how 
people working in this subdiscipline understand knowledge representation research, to 
illuminate the issues on which current research is focused, and to catalogue what 
approaches and techniques are currently being developed. Secon ... 



17 Learning methods to combine linguistic indicators: improving aspectual classification 
and reyeafe^ 
Eric V. Siegel, Kathleen R. McKeown 

December 2000 Computational Linguistics, Volume 26 issue 4 

Full text available: = re ><rv (fffl 

H ]pvA1.95 sv1B;* 0! Additional Information: full citation , abstract , references 
EufeJjsher Site 

Aspectual classification maps verbs to a small set of primitive categories in order to reason 
about time. This classification is necessary for interpreting temporal modifiers and assessing 
temporal relationships, and is therefore a required component for many natural language 
applications.A verb's aspectual category can be predicted by co-occurrence frequencies 
between the verb and certain linguistic modifiers. These frequency measures, called 
linguistic indicators, are chosen by linguistic insi ... 

18 Sources of Success for Boosted Wrapper Induction 
David Kauchak, Joseph Smarr, Charles Elkan 

December 2004 The Journal of Machine Learning Research, volume 5 

Full text available: ^.p.dft281.4§. KB) Additional Information: MlQitatjon, abstract, index terms 

In this paper, we examine an important recent rule-based information extraction (IE) 
technique named Boosted Wrapper Induction (BWI) by conducting experiments on a wider 
variety of tasks than previously studied, including tasks using several collections of natural 
text documents. We investigate systematically how each algorithmic component of BWI, in 
particular boosting, contributes to its success. We show that the benefit of boosting arises 
from the ability to reweight examples to learn specifi ... 

1 9 Dlscovering.ali. most. specif Ic.^ntences 
Dimitrios Gunopulos, Roni Khardon, Heikki Mannila, Sanjeev Saluja, Hannu Toivonen, Ram 
Sewak Sharma 

June 2003 ACM Transactions on Database Systems (TODS), Volume 28 Issue 2 

Full text available- f ?\ pdf(^3 09 KB) Additiona{ Information: full citation , abstract , references , citings , index 
' ' * terms 

Data mining can be viewed, in many instances, as the task of computing a representation of 
a theory of a model or a database, in particular by finding a set of maximally specific 
sentences satisfying some property. We prove some hardness results that rule out simple 
approaches to solving the problem.The a priori algorithm is an algorithm that has been 
successfully applied to many instances of the problem. We analyze this algorithm, and 
prove that is optimal when the maximally specific sen ... 

Keywords: Data mining, association rules, learning with membership queries, maximal 
frequent sets, minimal keys 
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Manuel J. Mafia-Lopez, Manuel De Buenaga, Jose M. Gomez-Hidalgo 

April 2004 ACM Transactions on Information Systems (TOIS), volume 22 issue 2 

Full text available- jj hxjgigg 31 KB) Additiona! information: fall citation , abstract , references, index terms. 
' S review 

A more and more generalized problem in effective information access is the presence in the 
same corpus of multiple documents that contain similar information. Generally, users may 
be interested in locating, for a topic addressed by a group of similar documents, one or 
several particular aspects. This kind of task, called instance or aspectual retrieval, has been 
explored in several TREC Interactive Tracks. In this article, we propose in addition to the 
classification capacity of clustering techn ... 

Keywords: Multidocument summarization, topic segmentation 
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21 The Hearsay-ll Speech-Understanding System: integrating Knowledge to Resolve 
Uncertainty. 

Lee D. Erman, Frederick Hayes-Roth, Victor R. Lesser, D. Raj Reddy 
June 1980 ACM Computing Surveys (CSUR), volume 12 issue 2 

Full text available: fl ] >xif{3.63 MB) Additional Information: full citation , references , citings , index terms 



22 SppMn 

Michael F. McTear 

March 2002 ACM Computing Surveys (CSUR), Volume 34 issue l 

Full text available- fl|pjjfi8fi7 §S KB! Additional Information: full citation , abstract , references , citings , index 



Spoken dialogue systems allow users to interact with computer-based applications such as 
databases and expert systems by using natural spoken language. The origins of spoken 
dialogue systems can be traced back to Artificial Intelligence research in the 1950s 
concerned with developing conversational interfaces. However, it is only within the last 
decade or so, with major advances in speech technology, that large-scale working systems 
have been developed and, in some cases, introduced into commerc ... 

Keywords: Dialogue management, human computer interaction, language generation, 
language understanding, speech recognition, speech synthesis 
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language understanding 
Frederique Segond, Karen Jensen 

August 1992 Proceedings of the 14th conference on Computational linguistics - Volume 
3 

Full text available: ^pdf(499.02.KB) Additional Information: full citation, references 
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March 1998 Computational Linguistics, volume 24 issue l 
Full text available 



^jpdf{1.35 MB) W$ Additional Information: full citation , abstract, references , citings 
PyMsher.Sjte 

Corpus-based approaches to word sense identification have flexibility and generality but 
suffer from a knowledge acquisition bottleneck. We show how knowledge-based techniques 
can be used to open the bottleneck by automatically locating training corpora. We describe 
a statistical classifier that combines topical context with local cues to identify a word sense. 
The classifier is used to disambiguate a noun, a verb, and an adjective. A knowledge base 
in the form of WordNet's lexical relations is ... 

25 Extracting^ 

associations: a semantic approach 

Shian-Hua Lin, Chi-Sheng Shih, Meng Chang Chen, Jan-Ming Ho, Ming-Tat Ko, Yueh-Ming 
Huang 

August 1998 Proceedings of the 21st annual international ACM SIGIR conference on 
Research and development in information retrieval 

Full text available: S.P.dfil.02. MB) Additional Information: Ml citation, references, citings, intextenro 
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Wee Meng Soon, Hwee Tou Ng, Daniel Chung Yong Lim 
December 2001 Computational Linguistics, Volume 27 Issue 4 

Full text available:,™ -omdvIII 

^paftj„.ogjy1Bi.^ Additional Information: fuH.cMtj.Qn, abstract, references, citings 
Publisher Site 

In this paper, we present a learning approach to coreference resolution of noun phrases in 
unrestricted text. The approach learns from a small, annotated corpus and the task includes 
resolving not just a certain type of noun phrase (e.g., pronouns) but rather general noun 
phrases. It also does not restrict the entity types of the noun phrases; that is, coreference 
is assigned whether they are of "organization," "person," or other types. We evaluate our 
approach on common data sets (namely, the M ... 

Joseph A. Goguen, Rod M. Burstall 

January 1992 Journal of the ACM (JACM), volume 39 issue 1 

Additional Information: full citation, abstract , references, citings, index 



Full text available: B m?>dff3.S1 MB) 

terms 

There is a population explosion among the logical systems used in computing science. 
Examples include first-order logic, equational logic, Horn-clause logic, higher-order logic, 
infinitary logic, dynamic logic, intuitionistic logic, order-sorted logic, and temporal logic; 
moreover, there is a tendency for each theorem prover to have its own idiosyncratic logical 
system. The concept of institution is introduced to formalize the informal notion of "logical 
system. &rdquo ... 

28 The interaction of knowledge sources in word sense disambiguation 
Mark Stevenson, Yorick Wilks 

September 2001 Computational Linguistics, Volume 27 Issue 3 

Full text available:,^ . , [H 

T| |pe!f(ZiosVlBj^ Additional Information: full citation, abstract , reference;; 
PubljsherSite 

Word sense disambiguation (WSD) is a computational linguistics task likely to benefit from 
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the tradition of combining different knowledge sources in artificial in telligence research. An 
important step in the exploration of this hypothesis is to determine which linguistic 
knowledge sources are most useful and whether their combination leads to improved 
results. We present a sense tagger which uses several knowledge sources. Tested accuracy 
exceeds 94% on our evaluation corpus. Our system attempts ... 

29 A.stochastic.f inite 

Richard Sproat, William Gale, Chilin Shih, Nancy Chang 
September 1996 Computational Linguistics, Volume 22 Issue 3 

Full text available: ~ A jf| 

^fidfUiSlixlB). ™ Additional Information: Ml citation, abstract .references, citings 
Publisher Site 

The initial stage of text analysis for any NLP task usually involves the tokenization of the 
input into words. For languages like English one can assume, to a first approximation, that 
word boundaries are given by whitespace or punctuation. In various Asian languages, 
including Chinese, on the other hand, whitespace is never used to delimit words, so one 
must resort to lexical information to "reconstruct" the word-boundary information. In this 
paper we present a stochastic finite-state model whe ... 



30 Circumscription with homomorohisros: solving the equality and counterexample 
problems 

Peter K. Rathmann, Marianne Winslett, Mark Manasse 
September 1994 Journal of the ACM (JACM), Volume 41 issue 5 

Full text available: ^pdfl4.00 MB) Additional Information: ftili citation, abstract, references, index terms 

One important facet of common-sense reasoning is the ability to draw default conclusions 
about the state of the world, so that one can, for example, assume that a given bird flies in 
the absence of information to the contrary. A deficiency in the circumscriptive approach to 
common-sense reasoning has been its difficulties in producing default that Tweety * Blutto 
using ordinary circumscription, or conclude by default that a particular bird fl ... 

Keywords: circumscription, common sense reasoning 
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Hugo Liu, Henry Lieberman, Ted Selker 

January 2003 Proceedings of the 8th international conference on Intelligent user 
interfaces 

Full text available 1 fl!pdt7234 54 KB) Addltional Information: fuHcitaiipn, abstract, references, citings, index 
[gsj. ...... ........... „ terms 

This paper presents a novel way for assessing the affective qualities of natural language 
and a scenario for its use. Previous approaches to textual affect sensing have employed 
keyword spotting, lexical affinity, statistical methods, and hand-crafted models. This paper 
demonstrates a new approach, using large-scale real-world knowledge about the inherent 
affective nature of everyday situations (such as "getting into a car accident") to classify 
sentences into "basic" emotion categories. This co ... 

Keywords: affective UI, affective computing, commonsense reasoning, emotions, open 
mind commonsense, story understanding 
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May 2005 Proceedings of the 14th international conference on World Wide Web 

Full text available: ^pdfQ32 .66.KBJ Additional Information: full citation, abstract, references, index terms 

The Web has become an excellent source for gathering consumer opinions. There are now 
numerous Web sites containing such opinions, e.g., customer reviews of products, forums, 
discussion groups, and blogs. This paper focuses on online customer reviews of products. It 
makes two contributions. First, it proposes a novel framework for analyzing and comparing 
consumer opinions of competing products. A prototype system called Opinion Observer is 
also implemented. The system is such that with a ... 

Keywords: information extraction, opinion analysis, sentiment analysis, visualization 



33 EaM.detection.of.OT ! 
Thomas Kunz, Michiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced Studies 
on Collaborative research 

Full text available: pdf(4.21 MB) Additional Information: full citation , abstract , references, index terms 

Understanding distributed applications is a tedious and difficult task. Visualizations based on 
process-time diagrams are often used to obtain a better understanding of the execution of 
the application. The visualization tool we use is Poet, an event tracer developed at the 
University of Waterloo. However, these diagrams are often very complex and do not provide 
the user with the desired overview of the application. In our experience, such tools display 
repeated occurrences of non-trivial commun ... 

34 Techpjcal .papers;_softa | 
design to source 

Elisa L. A. Baniassad, Gail C. Murphy, Christa Schwanninger 

May 2003 Proceedings of the 25th International Conference on Software Engineering 

Full text available:^ . 0 (§§ 

^AdUJLs^J^EUL^ Additional Information: full .citation, abstract, references, index ter^s 
Publisher Site 

A developer attempting to evolve a system in which design patterns have been applied can 
benefit from knowing which code implements which design pattern. For instance, the 
developer may be able to understand the purpose, or to assess the flexibility of the code, 
more quickly. The degree to which the developer benefits depends upon their 
understanding of the pattern. Achieving an in-depth understanding of even a simple pattern 
can be difficult as pattern descriptions span several pages of text, a ... 

35 Systems: An architecture for anaphora resolution | 
Elaine Rich, Susann LuperFoy 

February 1988 Proceedings of the second conference on Applied natural language 
processing 

Full text available: ®.pdf(639..69 KB). 

sf Additional Information: full citation , abstract , references, citings 

W publisher Site 

In this paper, we describe the pronominal anaphora resolution module of Lucy, a portable 
English understanding system. The design of this module was motivated by the observation 
that, although there exist many theories of anaphora resolution, no one of these theories is 
complete. Thus we have implemented a blackboard-like architecture in which individual 
partial theories can be encoded as separate modules that can interact to propose candidate 
antecedents and to evaluate each other's proposals. 
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Kazunori Komatani, Tatsuya Kawahara 

July 2000 Proceedings of the 17th conference on Computational linguistics - Volume 1 

Full text available: ^ odf(608.30 KB) Additional Information: full citation, abstract, references, citings 

We present a method to realize flexible mixed-initiative dialogue, in which the system can 
make effective confirmation and guidance using concept-level confidence measures (CMs) 
derived from speech recognizer output in order to handle speech recognition errors. We 
define two concept-level CMs, which are on content-words and on semantic-attributes, 
using 10-best outputs of the speech recognizer and parsing with phrase-level grammars. 
Content-word CM is useful for selecting plausible interpretatio ... 

37 Qyc; : Joward program 

Douglas B. Lenat, R. V. Guha, Karen Pittman, Dexter Pratt, Mary Shepherd 
August 1990 Communications of the ACM. Volume 33 Issue 8 

Full text available- f ^odft3.ftQ VIE; Additional Information: full citation , abstract , references, citings , index 
^ ^ terms, review; 

Cyc is a bold attempt to assemble a massive knowledge base (on the order of 108 axioms) 
spanning human consensus knowledge. This article examines the need for such an 
undertaking and reviews the authos' efforts over the past five years to begin its 
construction. The methodology and history of the project are briefly discussed, followed by 
a more developed treatment of the current state of the representation language used 
(epistemological level), techniques for efficient ... 

38 The correction of ill-formed input using history-based expectation with applications to | 
speech understanding 

Pamela K. Fink, Alan W. Biermann 

January 1986 Computational Linguistics, volume 12 issue 1 

Full text available:^ S 

*g jpar<1. ,4?viB)^ Additional Information: full citation , abstract , references, citings 
E.y.WjshexSite * - 

A method for error correction of ill-formed input is described that acquires dialogue patterns 
in typical usage and uses these patterns to predict new inputs. Error correction is done by 
strongly biasing parsing toward expected meanings unless clear evidence from the input 
shows the current sentence is not expected. A dialogue acquisition and tracking algorithm is 
presented along with a description of its implementation in a voice interactive system. A 
series of tests are described that show the ... 



39 Applications: Menu-based naturai language understanding 

Harry R. Tennant, Kenneth M. Ross, Richard M. Saenz, Craig W. Thompson, James R. Miller 
June 1983 Proceedings of the 21st annual meeting on Association for Computational 
Linguistics 

Full text available: *f| pdf{71 5.59 KB) 

Ms Additional Information: full citation , abstract, references , citings 

wpubjjsher Site 

This paper describes the NLMenu System, a menu-based natural language understanding 
system. Rather than requiring the user to type his input to the system, input to NLMenu is 
made by selecting items from a set of dynamically changing menus. Active menus and 
items are determined by a predictive left-corner parser that accesses a semantic grammar 
and lexicon. The advantage of this approach is that all inputs to the NLMenu System can be 
understood thus giving a 0% failure rate. A companion system t ... 
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Vesper Owei 

July 2002 ACM Transactions on Information Systems (TOIS), Volume 20 Issue 3 
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Missing information, imprecision, inconsistency, vagueness, uncertainty, and ignorance 
abound in information systems. Such imperfection is a fact of life in database systems. 
Although these problems are widely studied in relational database systems, this is not the 
case in conceptual query systems. And yet, concept-based query languages have been 
proposed and some are already commercial products. It is therefore imperative to study 
these problems in concept-based query languages, with a view to ... 

Keywords: ambiguous query, anaphoric query, concept-based query, conceptual query 
language, elliptical query, imperfect queries, incomplete information, inconsistency, 
inexplicit query, missing information, natural language interface, natural language query, 
semantically mismatched query 
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